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Abstract — Iterative channel estimation (ICE) usually exploits 
soft information of unknown data symbols as references to 
improve estimation performance. This paper investigates ICE for 
orthogonal frequency division multiplexing (OFDM) over wireless 
channels. The optimum ICE is derived in terms of maximum 
a posteriori (MAP) criterion, which can be solved using fixed- 
point iteration (FPI). Furthermore, the derived MAP ICE is 
closely related to the well-known expectation-maximization (EM) 
estimation. We also demonstrate that the MAP ICE converges 
within only one step when the signal-to-noise ratio (SNR) is large 
through analysis and simulation results. 

Index Terms — OFDM, iterative channel estimation, EM algo- 
rithm, performance bound. 

I. Introduction 

ORTHOGONAL frequency division multiplexing 
(OFDM) can provide high data rate transmission over 
wireless channels due to its robustness against frequency 
selectivity [1]. In an OFDM system, the frequency selective 
channel is converted into a group of flat fading channels, 
which can be easily compensated using a simple one-tap 
equalizer. As a result, the design of OFDM receiver is greatly 
simplified, and thus OFDM is widely used in many wireless 
systems [2]. 

With knowledge of frequency response of wireless channels 
at the receiver, coherent detection can be used in OFDM sys- 
tems, which has a much better performance than differential 
detection that requires no channel knowledge. Channel param- 
eters need to be estimated for coherent detection. Traditional 
approaches adopt one-round manner for channel estimation 
[3], [4], and, thus the estimation performance depends on the 
pattern of pilots [5]. 

Following the success of turbo code [6], the turbo principle 
has been widely applied in receiver design [7], and has been 
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first used in OFDM receiver in [8]. The turbo receiver enables 
channel estimation to be performed iteratively using the soft 
information of data symbols. With the soft information fed 
back by the detector, the unknown data symbols are replaced 
by the soft symbols which are the averages of possible 
constellation points. The regenerated soft symbols and the 
pilot symbols are used together to improve channel estimation 
for the next iteration. By taking into account the inaccuracy 
of regenerated soft symbols, an optimized iterative channel 
estimation has been developed in [9]. 

Since the regenerated soft symbols are not reliable when 
the signal-to-noise ratio (SNR) is low, different threshold 
based approaches have been proposed in [10]— [12] to im- 
prove the estimation performance in this case. The basic 
idea for the threshold approach is that a regenerated soft 
symbol is used for channel estimation only if its reliability is 
above a threshold. For a system with high order quadrature- 
amplitude-modulation (QAM), such as 64-QAM, the con- 
stellation contains many points, and thus regenerating soft 
symbols needs heavy computation. To reduce the complexity, 
a sub-constellation, such as 4-QAM, is used in [13], where 
the data symbol is regenerated using only several points with 
the highest probabilities. It is also possible to feed back the 
hard-decision symbols rather than the soft information [14]. 

In this paper, we propose an optimum iterative channel 
estimation (ICE) in terms of maximum a posteriori (MAP) 
criterion, which can be solved iteratively using fixed-point 
iteration (FPI) [15]. Instead of using arithmetic average [8]- 
[10], the derived optimum estimator suggests using a harmonic 
average over possible constellation points to regenerate soft 
symbols. We find a close connection between the proposed 
MAP ICE and the well-known expectation-maximization (EM) 
estimation [16], [17]. EM estimation includes two steps: an 
expectation- step (E-step) and a maximization- step (M-step). 
The EM estimation can find the maximum-likelihood (ML) 
estimation of unknown parameters through iterations between 
E-step and M-step when there is missing data. The EM 
estimation can be extended to MAP estimation by modifying 
the M-step with a given a priori information. Moreover, we 
demonstrate the monotonic convergence of the proposed MAP 
ICE, and show the proposed approach can converge within one 
step when the SNR is large, as confirmed by our analysis and 
the simulation. 

The rest of this paper is organized as follows. The system 
model is presented in Section II. Optimum ICE is derived 
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in Section III. Convergence behavior is discussed in Section 
IV. In Section V, the performance bound in terms of Bayesian 
Cramer-Rao Bound (CRB) [18] is analyzed. Simulation results 
are shown in Section VI. Finally, conclusions are drawn in 
Section VII. 

II. System Model 

In this section, we will describe the channel model and the 
OFDM system with soft-output detector. 



pilot 
symbol 


soft 
symbol 
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MMSE filter 



soft-output 
detector 



hard 
decision 



channel estimator 

Fig. 1 . A general structure for OFDM receiver with the ICE. The superscript, 
denotes the i-th iteration, and AW is a K X 1 vector, denoting the 
LLRs of K symbols. 



A. Channel Model 

The complex baseband representation of channel impulse 
response (OR) of a wireless channel can be described by 



h(r) = ^2aig(r - n), 



(1) 



where t; and on are the delay of the Z-th path and its complex 
amplitude, and g(r) is the impulse response of the equivalent 
system of the transmit and receive filter. Then, the channel 
frequency response (CFR) can be expressed as 



H(f) = G(f)^ ai e^' 



(2) 



where G(f) is the Fourier transform of g(t). With proper 
guard subcarriers in OFDM systems, G(f) can be regarded 
as a constant, Without loss of generality, assume G(f ) — 1 in 
the subsequent discussion. 

Under the uncorrelated scattering assumption [19], the com- 
plex amplitudes, a; 's, can be modeled as independent complex 
Gaussian random variables. Hence, the correlation function of 
CFR is 

r ( A/) 4 E{H* (/ + Af)H(f)} = £ of e~^ A ^' , (3) 

i 

where of = E(af ) is the power of the Z-th path. 

For an OFDM system with symbol duration, T, the subcar- 
rier spacing will be 1 /T according to the orthogonal condition. 
Given that there are K subcarriers, the CFR at the fc-th 
subcarrier can be expressed as 



H[k] = H{k/T). 



(4) 



Correspondingly, the correlation function for different subcar- 
riers is described by 



r[k] = r{k/T). 
Define H e C Kxl to be the CFR vector, as 
I H[0] 

H = 



(5) 



(6) 



V H[K - 1] 

The correlation matrix of CFR vector, H, is represented by 
/ r[0] • • • r[l - K] \ 
Rh — E (HH H ) = : •. : . (7) 

\r[K-l] ••• r[0] ) 

In a typical Rayleigh fading channel, the CFR vector, H, is 
a complex Gaussian distributed random vector. Hence, the 



probability density function (pdf) can be expressed as 1 [19] 



1 



-H H R„'H 



Tr^dc^n) 
where det(-) is the determinant of a matrix. 



(8) 



B. OFDM Receiver with Soft-Output Detector 

In an OFDM system with K subcarriers, the received signal, 
after removing the cyclic prefix (CP) and inverse fast Fourier 
transform (IFFT), can be represented as 



y[k] = H[k]x[k] + w[k], 



(9) 



where y[k] denotes the received signal at the fc-th subcarrier, 
x[k] is the transmitted symbol at the fc-th subcarrier, and 
w[k] is the additive white Gaussian noise with zero mean and 
E(K| 2 H^. 

The transmitted symbol, x[k], is either a pilot symbol that 
is known to the receiver or an unknown data symbol. The 
pilot symbols are used to estimate channel for the detection 
of the unknown data symbols. In this paper, we assume the 
transmitted symbols have constant modulus to facilitate our 
discussion. However, the proposed approach can be used to 
the cases of non-constant modulus symbols. 

A general structure of OFDM receiver with the ICE is 
shown in Fig. 1. To perform channel estimation iteratively, 
soft information about unknown data symbols are fed back by 
a soft-output detector. Different from a hard detector, the soft- 
output detector yields not only the decision results but also 
the reliability of the decisions. Without loss of generality, we 
assume that four quadrature amplitude modulation (4-QAM) 
is used 2 . 

Denote b[k, I], for I = 0 and 1, to be the l-th bit of the fc-fh 
QAM symbol. Then, the modulated symbol can be expressed 



as 



x[k] = 7=2 



(10) 



and the log-likelihood ratio (LLR) for b[k,l], yielded by the 
soft-output detector, can be represented as [8] 



P{6[fc,q = l|y,H} 
A[MJ - ln p{6M = 0|y,H}' 



(11) 



'Although Rh is rank deficient and thus not invertible, we can only focus 
on the linear independent components of H, and treat the other components 
as their deterministic linear combinations. This allows us to work with a 
compact notation in (8). Similar discussion about the rank deficient case can 
be found in Appendix A of [19]. 

2 Following similar procedure, the discussion here can also be used for other 
modulation scheme, e.g. 16-QAM. 
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where y e C Kxl is the received signal vector and can be 
expressed as 



y[o] 



y[K - 1] 



(12) 



With the LLR in (11), the conditional probabilities for 
b[k, I] = 1 and b[k, I] = 0, with I = 0 and 1, can be expressed, 
respectively, as 

A[fc,Z] 

P{&M = l|y,H}= i ; _ Ar , f1 , (13a) 



P{6[M] = 0|y,H} 



+ e A[fc,«] ' 
1 



(13b) 



I _|_ e A[fe,/] ' 

Therefore, the probability of x[k] = c mn is represented by 

P{x[k] = c mn |y, H} = P{b[k, 0] = m\y, H}P{fe[fc, 1] = n|y, H}, 

(14) 

where c mn denotes the constellation point corresponding to 

b[k,0] = m,b[k, 1] = n. 



III. Optimum Iterative Channel Estimation 

In this section, we will develop the optimum MAP estima- 
tion and discuss its implementation. 



A. MAP Estimation with FPI 

Given the received symbol vector, y, the optimum estima- 
tion of CFR in terms of the MAP criterion can be expressed 
as 

H M ap = argmax{lnp(H|y)} 

{H} 

= argmax{lnp(H) + lnp(y|H) - lnp(y)}. (15) 

{H} 

Therefore, the MAP estimation of CFR satisfies 

ainp(H) 01np(y|H) 

-8HT- + dH* = °' (16) 
where (•)* denotes the conjugate. In (16), the first term 
corresponds to a priori knowledge about channel distribution, 
and the second term corresponds to the MLestimation. 

Using the pdf in (8), it is easy to obtain the first term in 
(16) as 

91np(H) 



<9H* 



R H lfI 



(17) 



If the transmitted symbols are known, such as pilot symbols, 
it is also easy to derive an analytical solution for the second 
term in (16). However, since there are unknown data symbols, 
the conditional pdf, p(y|H), is unknown, and thus it is 
difficult to find a solution directly. One straightforward way 
is to express the conditional pdf, p(y|H), as the sum of 
p(y|H,x) using the complete-probability theorem. However, 
this approach makes the calculation complex and difficult for 
further manipulation. 



To simplify it, we first rewrite the second term in (16) as 
ainp(y|H) 1 9p(y|H) 



<9H* 



p(y|H) 8U* 



Q 



p(y|H) £t 



gp(y|H,x g )P{xJ 
<9H* 



(18) 



where the second equation follows from the complete- 
probability theorem. In (18), x 9 denotes the q-th possible 
symbol vector. For the QPSK modulation, we have Q = 4 K 
possible symbol vectors if all the symbols are unknown data 
symbols. Since 

dlnp(y|H,x g ) _ 1 dp(y|H,x 9 ) 



on* p(y|H,x g ) 

then (18) can be written as 



<9H* 



(19) 



01np(y|H) 



Q 

E 

9=1 

Q 



p(y|H,x g )P{xJ ainp(y|H,x g ) 



9H* p(y|H) <9H* 

d\np(y\n, Xq ) 



^P{x,|y,H} 

q=l 



(20) 



Note that P{x 9 |y, H} is the a posteriori probability of the q- 
th possible symbol vector, x g , and it depends on the detector 
we use. Given the q-th transmitted symbol vector, x g , and the 
CFR, H, it is easy to obtain 



p(yl H > x 9 ) 



l 



4-||y-X,H||| 



(21) 



where 1 1 • 1 1 2 denotes the 2-norm of a vector, and X 9 is a K x K 
diagonal matrix with the fc-th diagonal element from the fc-th 
element of vector x g . Assuming constant modulus modulation 
is used, e.g. QPSK, we have 

'"y^ '^V-H), (22, 

where 7 = al/a^ is the SNR. Substituting (22) into (20), we 
finally obtain 

01np(y|H) 



dH 



g2i_ T (rW-H) 



where X(H) is the soft symbol matrix given by 
X(H)= ^P{x g |y,H}X g ^ . 



(23) 



(24) 



Equation (24) implies that the soft symbol matrix is a function 
of the CFR. 



Combining (17) and (22), the MAP estimation of CFR, H, 
satisfies 

H = R H (Rh + X" 1 (H)y. (25) 

It is impossible to find a closed-form solution for H from 
(25). The FPI approach [15] can be used to solve this kind 
of equation. With an initial estimation, H( 0 ), FPI solves (25) 
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Fig. 2. The structural representation of the proposed MAP ICE. 



iteratively using the following relation 



H< i+1 > = R H ( Rh 



X 1 (H«)y, (26) 



for i = 0, 1,2, where HW denotes channel estimation 
after the i-th iteration. Usually, the initial estimation, H'°', can 
be obtained using traditional pilot-aided channel estimation. 

B. Harmonic Average 

Note that the average in (24) is carried out over all possible 
symbol vectors. This is different from the traditional ICE, 
where the average is performed over possible constellation 
points for each symbol. Actually, the vector-wise average in 
(24) can be also simplified into a symbol-wise average by 
noting that the same symbol may occur in different symbol 
vectors. We consider the QPSK modulation as an example. 
The analysis can also be used for non-constant modulus 
modulation such as 16-QAM. 

From (24), the fc-th element of the soft symbol matrix, 
X(HW), can be expressed as 
Q 



(Q/4 1 1 N 
X^j^M = Cmn,X 9i[fc] |y,H W |c"„^ 
g— 1 m— 0 n=0 . 



(27) 



where x ? j fe ] denotes the g-th possible vector excluding the k- 
th element. Using the complete-probability theorem [20], we 
have 

Q/4 

^P{x[fc] = c ro „,x g , [fc] |y,HW} = P{x[k] = c m „|y,H«}. 

9=1 

(28) 

Then, (27) can be simplified as 



*W = Y>Y>P{x[k]=c mn \y,H^}c^ n 

\m=0n=0 



(29) 



where the probability P{x[A:] = c mn \y, H^} is given in (14). 
As we can see, the proposed algorithm suggests a symbol-wise 
harmonic average for the data symbol, which is different from 
the arithmetic average widely used in the existing work. 

A structural representation of the proposed MAP ICE is 
shown in Fig. 2. The MAP ICE is exactly the same as the 
OFDM receiver in Fig. 1 except that a harmonic average 



is used for generating soft symbols. The soft symbols are 
regenerated using (29), and then sent to a minimum mean- 
square-error (MMSE) filter together with the pilot symbols, 
i.e. (26), to yield the channel estimation for the next iteration. 
For arithmetic average based approach [8], the soft symbol is 
regenerated as 



M 



x[k] = Y t Hx[k] = c i \y^}c i . 



(30) 



When the SNR is small, there is little a posteriori information 
can be extracted, and thus we may have P{ir[fc] = c,|y, H} w 
1/M and in turn x[k] ss 0. Apparently, a zero-approaching 
soft symbol will degrade the channel estimation performance 
through y[fc]/a;[fe]. On the other side, there is no such problem 
if we use harmonic average. Hence, the performance using 
harmonic average can be improved especially at small SNRs 
as demonstrated by our simulation in Section VI. 

In the above, we have presented the MAP ICE for QPSK 
modulation. For non-constant modulus modulation such as 16- 
QAM, we can use the approximation XX H w E(XX H ) = 
er^I where E(|x[fc] | 2 ) = a 2 x is now the average transmit power. 
Then, the proposed approach above can also be used for non- 
constant modulus modulation. 



C. Complexity 

Since the constellation is known to the receiver, the inverse 
of constellation point, c~^, in (29) can be thus obtained in 
advance. We need 4K complex multiplication to yield the 
soft symbols during each iteration. Note that the complexity 
for harmonic average in (29) is the same as the arithmetic 
average in [8]. Taking into account the MMSE filter in (26), 
we need K 2 + AK complex multiplications in total during 
each iteration. 



IV. Convergence Analysis 

In this section, we first discuss the relationship between 
the MAP ICE and EM estimation, and then investigate the 
convergence behavior. 



A. Relation with EM Estimation 

In the framework of EM estimation, the complete data, z, 
contains the received symbol vector, y, as the observation, 
and unknown data symbol vector, x, as the missing data. CFR 
vector, H, is unknown. The purpose is to estimate CFR in the 
presence of unknown data symbols. 

Following the definition of EM estimation, the E-step can 
be represented as 

Q 

Q(H|H«)=^P{x 9 |y,H«}lnp(y,x 9 |H). (31) 

9=1 

Using Bayesian theorem and noting that x g and H are 
independent, we can obtain 

lnp(y, x g |H) = Inp(y|H, x,) + In P{x g |H} 

= lnp(y|H,x 9 )+lnP{x g }. (32) 
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Since the second term in (32), i.e. lnP{x 9 }, is not used in 
the M-step, (31) is equivalent to 

Q 

Q(H|H«) - ]T P{x 9 |y, H«} lnp(y|H, x g ). (33) 

9=1 

Then, given a priori distribution, p(H), the M-step is 

H^ 1 ) =argmax{Q(H|H«)+p(H)) 
{H} L J 

= argmaxfo(H|H w )+p(H)} (34) 
{H} L J 

which can be easily solved by setting the derivative to be zero. 
Direct calculation yields the same result as (26). 

The analysis above shows that the proposed MAP ICE is 
equivalent to the EM estimation. Actually, the EM estimation 
has been widely adopted in OFDM receiver designs [21]-[23]. 
By carefully choosing the data set, the EM estimation can 
be used for channel estimation [21], [23] or signal detection 
[22]. The proposed MAP ICE takes into account the soft 
information fed back by the soft-output detector in the E- 
step. As a result, the ICE can be conducted with the a priori 
information of unknown data symbols, and thus the estima- 
tion performance can be further improved. This observation 
reveals the connection between the EM estimation and the 
widely adopted ICE in the turbo receiver [8], [9], where soft 
information is fed back to facilitate channel estimation. Instead 
of arithmetic average, the proposed MAP ICE shows that the 
best way to use soft information is the harmonic average. 

B. Convergence of the MAP ICE 

Since the MAP ICE can be explained in the framework of 
the EM estimation, the convergence of the proposed MAP ICE 
is guaranteed since the EM algorithm always converges with 
a proper initial value [16]. 

In view of (15), we define a cost function, /(H), as 



J(H)=Inp(H)+Inp(y|H). 
As we have proved in Appendix A, 

Z(H) - Z(H«) = |Q(H|H«) + lnp(H) 
Q(H«|H«) +lnp(H« 



(35) 



(36) 



Since (34) is a maximization operation, the estimation for next 
iteration, H^ +1 ^, guarantees that 

Q(H( i+1 )|H«)+lnp(H( i+1 )) > Q(H«|H«)+lnp(H«). 

(37) 



As a result, we have 



Z(H< i+1 )) > J(H«), 



(38) 



which indicates that the MAP ICE approach converges mono- 
tonically. 

C. Convergence Rate 

The iterative procedure in (26) implicitly defines a mapping 
H -> g(H) such that 

H< i+1 ) = g(H«). (39) 



Assuming that HW converges to the true CFR and g(H) is 
differentiable at H, which is true in our context, a Taylor 
expansion of (39) yields 

H (i+i) h = Jh • (HW 



H) 



|H«-H|||), (40) 



where o(-) denotes the higher order infinitesimal, and Jh is 
a K x K Jacobian matrix, defined by 



Jh = 



a 0g(H) 



<9H T 



- 1 dX \H)y 
<9H T ' 



(41) 



In view of (40), the convergence rate is determined by the 
spectrum radius, p(Jh) [23]. 

If there is error-correction coding over the transmitted 
OFDM symbols, it is difficult to calculate the derivative in 
(41). However, we believe that redundancy in coding will help 
to improve channel estimation. For an uncoded transmission, 
the LLRs for the fc-th symbol only depends on the received 
signal at the fc-th subcarrier and the CFR at the fc-th subcarrier. 
Then, the LLR in (11) can be simplified as [8] 

P{b[k,l} = l\y[k},H[k}} 



A[k,l] = In 



P{b[k,l} = 0\y[k},H[k}} 
Pw + Pn 



In 



In 



Poo 
Poi 



Poi 
Pii 



where p ri 



if I 

if I = : 

Poo + Pw 
is the conditional pdf given by 

Pmn =p(y[k]\H[k],x[k] = c mn ] 
1 — h-\v[k]-H[k]c m „\ 2 

= TT e " m 



0 



(42) 



(43) 



In (42), we have ignored the a priori information term since 
the channel estimator provides no a priori information to the 
detector. 

In (42), the LLRs for the fc-th symbol are independent of the 
CFRs at other subcarriers, so is the fc-th soft symbol, x _1 [fc]. 
Hence, the derivative term in (41) becomes a diagonal matrix, 



9X \u)y 
<9H T 



dH[0] 



V 



dx-^K- l]y[K- 1] 
dH[K - 1] 



/ 
(44) 

Using (29) and (42), after some manipulation, the inverse of 
the fc-th soft symbol becomes 

3 A[fc,0]\ / p A[fc,l] 



x- 1 



[fc] 



1 



\f2o x 



1 



1 



oA[fe,0] 



1 



; A[fe,l] + l 



(45) 



The derivative of x _1 [fc] with respect to H[k] can be con- 
ducted for the real part and imaginary part, separately. Taking 
the derivative of the real part, Re {x _1 [fc]}, for example, the 
modulus of the real part for the fc-th term of (44) can be 
expressed as 



| dH[k] 



V2y[k] 



dA[k,0] 



| <r x (e A [ fc .°] + e-MKO] + 2 ) dH[k] 



(46) 
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Fig. 3. Graphical explanation of the one step property. 



As we have shown in Appendix B, 



<9A[fc,0] 



dH[k] 



< 



2a x y*[k] 



(47) 



Furthermore, from Appendix C, when the SNR is large 
enough, we have 



|A[fc,0]|>2|tf[fc]| 2 7 . 



Using the relation in (47) and (48), we can derive a high SNR 
approximation of (46) as 



dH[k] 



< 



2V2\H{k]\ [ 



7 



e 2|ff[fc]|2 7+e -2|ff[fc]|2 7+2 



0. 

(49) 



The second approximation is obtained by noting that the linear 
term of the SNR in the numerator is much smaller than 
the exponential term of the SNR in the denominator as the 
increase of the SNR. Similar conclusion can be drawn for the 
imaginary part, Im {:r _1 [fc]}. 

From (49), the Jacobian matrix, Jh, approaches to zero 
when the SNR is high, and so is the spectrum radius, /o(Jh)- 
This means that the proposed approach has a superlinear 
convergence rate [23]. Actually, we can observe that the 
proposed algorithm can converge within one step from our 
simulation result in Section VI. 

Recalling that the proposed approach is based on FPI, the 
one step convergence property can be also observed in a 
graphical manner in Fig. 3. For a scalar FPI solution, e.g. 
x = g(x), as shown in the figure, the iterative procedure 
converges if and only if the absolute value of derivative at 
the true value, |g'(a;)|, is smaller than unity [15]. Otherwise, 
it will diverge. A special case is that the iterative procedure 
can converge within one step when g'{x) — 0. For a vector 
case, the derivative, g'(x), corresponds to the determinant of 
Jacobian matrix, i.e. det(Jn)- We have shown in the above 
that the Jacobian matrix approaches to zero when the SNR is 
large enough, and so is its determinant. Hence, we can expect 
that the proposed approach converges within one step. 

V. Performance Bound 

Since we have a priori information about the channel statis- 
tics, the performance bound can be evaluated using Bayesian 
CRB [18]. Different from the classical Bayesian CRB, the 
observation equation is partially known since the data symbols 
are unknown. 

From [18], the MSE of channel estimation can be bounded 
by the inverse of Bayesian information matrix (BIM), Jb, that 



is 

MSE> -Tr^B 1 ). 
The BIM can be expressed as 

Jb = Jp + Jf, 
where Jp is a priori information matrix, defined as 

~d 2 lnp(H) 



(50) 
(51) 



J P = E 



H 



<9H*5H T 

and, Jp is the Fisher information matrix (FIM), defined as 

"a 2 lnp(y|H) 



(52) 



J F = E 



H 



E 



y|H 



<9H*<9H T 



From (17), 



Jp — — R 



H 



(48) Using (23), we can further derive that 



9 2 lnp(y|H) 



= -7I + 7 



aX (H)y 



(53) 



(54) 



(55) 



<9H*<9H T ~ ' ' dH T ' 
In (55), the first term indicates exactly the performance bound 
of the ML estimation when the transmitted symbols are all 
known pilots. The second term shows the impact of unknown 
data symbols. As we can see, the impact caused by unknown 
data symbols depends on the derivative of detector perfor- 
mance at CFR. Due to unknown data symbols, the second 
term presents, and thus the performance bound increases. 

Further manipulation on the second term is difficult. How- 
ever, we can gain some insights if the SNR is high enough. In 
this case, the second term becomes a diagonal matrix. Similar 
to (49), we can obtain the real part for the fc-th diagonal 
element as 



5Re{x '(H^y 



<9H T 



2V2\H[ 



IV 



e 2|ff[fc]|2 7 + e -2|iJ[fc]| = 7+2 



J (fe,fe) 



0. 



(56) 



Different from (49), the numerator in (56) is a square term 
of the SNR. Even though, it is still much smaller than the 
exponential term of the SNR in the denominator. Similarly, 
the imaginary part is also zero. As a result, the second term 
in (55) approaches to zero for a high SNR. Substituting the 
relations (54) to (56) into (50), we can obtain the expression 
of the MSE for high SNR, that is 



K 



MSE >J2j 



fc=i 



(57) 



where is the fc-th eigenvalue of the CFR correlation matrix, 
Rh- 

When SNR is high, the channel estimation is accurate 
enough. In this case, further improvement on channel estima- 
tion performance does not necessarily improve the detector 
performance. The detector performance is now mainly deter- 
mined by the additive noise rather than the channel estimation 
error. In other words, we can say that the performance of 
detector is insensitive to the channel estimation error when the 
SNR is high. Mathematically, this insensitivity is equivalent to 
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Fig. 4. MSE performance versus SNR with five iterations. 



Fig. 5. MSE performance versus SNR with five iterations and one iteration. 



having a very small derivative at the CFR, as we have shown 
above. 

VI. Simulation Results 

In this section, we present simulation results to demonstrate 
the performance of the proposed approach. For simulation, 
we consider an OFDM system with 128 subcarriers, with 
four guard subcarriers on each side. The subcarrier spacing is 
15 KHz, corresponding to a symbol duration of about 0.067 
msec. The CP is assumed long enough so that no inter-symbol 
interference in presence. QPSK and 16-QAM are both used 
in the simulation. Each OFDM symbol consists of 1/15 pilot 
uniformly distributed over the subcarriers and 14/15 data 
symbols. The 3 GPP TU channel model is employed in the 
simulation [24], which has 20 paths and the maximum delay 
spread is 2.14 usee. 

Fig. 4 shows the MSE performance of the proposed ap- 
proach. The threshold control algorithm based on arithmetic 
average [10] is also included. Different from the typical arith- 
metic average based approach such as [8], threshold control 
approach can avoid zero-approaching soft symbols at small 
SNRs. As a result, the channel estimation performance can 
be improved at small SNRs. Although the threshold control 
based approach can avoid performance degradation at small 
SNR, it is just a sub-optimal approach. As a result, the pro- 
posed approach, based on MAP criterion, can outperform the 
threshold control based approach, especially at low-medium 
SNRs. As a benchmark, the CRB for high SNR, i.e. (57), is 
also included in the figure. Due to the presence of unknown 
data symbols, there is still a 2 — 3 dB performance gap for 
QPSK when the SNR is large. 

Fig. 5 shows the MSE of the proposed approach with 
different iteration numbers. From the figure, the first iteration 
yields a significant performance improvement. More iterations 
can only yield small performance improvement. This coincides 
with our convergence analysis in Section IV. When the SNR 
is large enough, the proposed approach can converge within 



-Q — SNR = 10 dB, QPSK 
-a— SNR = 1 0 dB, 1 6-QAM 

SNR = 10dB, CRB 

0 -SNR = 20dB, QPSK 
a - SNR = 20 dB, 16-QAM 
- - SNR = 20 dB, CRB 




2 3 
Number of iterations 



Fig. 6. MSE performance versus iteration numbers at 0 dB, 10 dB, and 20 
dB, respectively. 



one step, and therefore the performance is improved a lot for 
the first iteration. Since the algorithm has already converged, 
almost no improvement can be observed for the following 
iterations. 

Fig. 6 further shows the one step convergence property of 
the proposed approach. When SNR = 20 dB, the first iteration 
yields a significant performance improvement while there is 
almost no improvement for further iterations. This is because 
the MAP ICE has converged after the first iteration for a high 
SNR. When the SNR is low, such as SNR = 10 dB, the 
second iteration still yields a little performance improvement 
since the approach does not converge yet. 

Fig. 7 shows the symbol-error ratio (SER) versus SNR. The 
SER performances with perfect channel and estimated channel 
using threshold control based approach are also included for 
comparison. From the figure, the SER performance using 
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Fig. 7. SER performance versus SNR. 

MAP ICE outperforms the one using threshold control based 
approach. For QPSK modulation, there is an about 0.7 dB 
performance between the MAP ICE approach and threshold 
control based approach. For 16-QAM, the performance gap is 
more than 1 dB. Higher order modulation is more sensitive 
to the accuracy of channel estimation. As a result, improved 
channel estimation accuracy can lead to more significant SER 
performance increase. 

VII. Conclusions and Further Research 

A MAP ICE approach is proposed in this paper to improve 
the channel estimation performance. The proposed approach 
is derived based on the MAP criterion, and uses a harmonic 
average over the constellation for regenerating the data sym- 
bol, and is different from the existing ICE approaches where 
arithmetic average is usually used. We have found that the 
proposed approach is closely related to the well-known EM 
estimation. This relation is utilized to study the convergence of 
the proposed approach. From the analysis and the simulation 
results, the proposed approach can significantly improve the 
performance within only one step. 

Transmission over multiple OFDM symbols in time-varying 
channel will be also considered for further research. Moreover, 
the proposed approach will be extended to multiple-input 
multiple-output (MIMO) transmission. 

Appendix A 
Derivation of (36) 

To derive (36), we first rewrite the cost function in (35) as 

/(H) = lnp(y|H)+lnp(H) 

Q 

= ln^p(y,x g |H)+lnp(H) 

9=1 

= >»J i ^^PMy,H»l + >»,<H,. 

(A.l) 



Using Jensen's inequality for the fist term, we have 

/(H) > g P{x g |y, H«} In p ffi^ } + In„(H). 

(A.2) 

With Bayesian theorem, we have 

In = In P(y|H W ), (A.3) 

P{x,|y,H«} p(y,x g |H«) WI h 

and thus (A.2) can be rewritten as 
/(H) > f>{x 9 |y, H«} In -ggj^ p(y |H«) + lnp(H) 

Q 

= ]T P{x 9 |y, H«} lnp(y, x,|H) + lnp(y|H«)- 

9=1 
Q 

PKIy, H (i) } lnp(y, x,|H (i) ) + lnp(H) (A.4) 

1=1 

By appending a term, lnp(HW) — lnp(HW), to (A.4), it can 
be further rewritten as 
" Q 

/(H) > J2 P Kly> hW } ln My, x,|H) + lnp(H) 

-9=1 

" Q 

PKIy, H«} lnp(y, x,|HW) + lnp(H«) - 

-9=1 

lnp(y|H w ) +lnp(H w ) 
Q(H|HW)+Inp(H) 

Q(HW|HW)+Inp(HW) +/(H«). (A.5) 

Move /(H( J )) in (A.5) to the left-hand side, then (36) is 
immediately obtained. 



Appendix B 
Derivation of (47) 

From (42), we have 

(P10 +Pll) 5J W ai - (POO +^0!)^^ 



dA[k, 0] 




dH[k] 





(pio +Pn)(poo +Poi) 



By noting that 

dprr. 



= Pr. 



y*[k]c 



dH[k] - \ af, 
(B.2) can be rewritten as 



H*[k]-y 



(B.l) 
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Poo + P01 



Pio + Pll 



(B.3) 



Then, using the relation \z± + Zi\ < \z\ \ + \z2\, where z±, z 2 
are any complex numbers, we can derive 
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Appendix C 
Derivation of (48) 



Using the approximation [25] 



\n(e Sl H he*")« max ( 

the LLR, A[fc, 0], can be approximated as 



|A[fc,0]| = 



In 



Pio +Pu 



Poo + Poi 



In 



max{pi 0 ,pii} 



max{p 00 ,Poi} 



(CI) 



(C2) 



Without loss of generality, we further assume pio = 
max{pi 0 ,pii} and p 00 = max{p 00 ,p 0 i}. Then, (C.2) can 
be rewritten as 



|A[fc,0]|> 



ln^ 

Poo 



= —\y*[k]H[k]c w + y{k]H*[k]c\ 
y*[k]H[k}coo~y[k}H*[k}c* 00 \. 
Substituting (10) into (C.3), we have 

\A[k,0}\>^^\Re{y*[k]H[k]}\. 



(C3) 



(C4) 



At high SNR, we ignore the noise term in the received signal, 
y[k], then 

\Re{y*[k}H[k}}\ « \H[k}\ 2 ■ \Re{x[k}}\ = \H[k}\ 2 ■ ^. 

(C.5) 



By substituting (C.5) into (C.4), we have 
|A[fc,0]|>2| J ff[fc]| 2 7 . 
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